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3 <110> APPLICANT: Meng , Xiang-Jin 

4 Haqshenas, Gholamreza 

5 Huang, Fang -Fang 

7 <120> TITLE OF INVENTION: Avian Hepatitis E Virus, Vaccines and Methods of Protecting 
Against Avian 

8 Hepatitis-Spenomegaly Syndrome and Mammalian Hepatitis E 
10 <130> FILE REFERENCE: AM100389 

12 <140> CURRENT APPLICATION NUMBER: US 10/029,840 

13 <141> CURRENT FILING DATE: 2001-12-31 
15 <160> NUMBER OF SEQ ID NOS : 11 

17 <170> SOFTWARE: Patentln version 3.1 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 3946 

21 <212> TYPE: DNA 

22 <213> ORGANISM: Hepatitis E virus 

24 <400> SEQUENCE: 1 

25 accagcattg gatttcgatg gacgctgttt aacgagcgcc gttgatcttg ggttgcagcc 60 
27 taccagctgg cgcaccgtat cccaccgttg cccttgggac gtttgtatat ttttgcgtac 120 
29 tgattatccg actatcacca caaccagtag ggtgctgcgg tctgttgtgt ttaccggtga 180 
31 aaccattggt cagaagatag tgtttaccca ggtggccaag cagtcgaacc ccgggtccat 240 
33 aacggtccat gaggcgcagg gcagtacttt tgatcagact actataatcg ccacgttaga 300 
35 tgctcgtggc cttatagctt catctcgcgc gcatgccata gttgcgctaa cccgccaccg 3 60 
37 ggagcgctgt agtgtgattg atgttggtgg ggtgctggtc gagattggag ttactgatgc 420 
39 catgtttaac aatatcgaaa tgcagcttgt gcgacctgat gctgcagccc ctgccggggt 4 80 
41 gctacgagcc ccagacgaca ccgtggatgg cttgttggac atacccccgg cccacactga 540 
43 tgtagcggcg gtgttaacag ctgaggcgat tgggcatgcg ccccttgaat tggccgccat 600 
45 aaatccaccc gggcctgtat tggagcaggg cctattatac atgccggcca ggcttgatgg 660 
47 gcgtgatgag gttgttaagc tccagctgtc ggatactgta cactgccgcc tggctgcacc 720 
49 cactagccgt cttgcggtga ttaacacatt ggttgggcgg tacggtaaag ccactaagct 780 
51 gcctgaggtt gaatatgact taatggacac tattgcgcag ttctggcatc atatcggacc 840 
53 aatcaacccc tcaacactgg agtatgcaga gatgtgcgag gccatgctta gtaagggcca 900 
55 ggatgggtcc ttgattgtac atctggattt acaggatgct gattgttctc gcataacatt 960 
57 cttccagaag gactgcgcta aatttacgct ggatgaccct gttgcacacg gtaaagtggg 1020 
59 acaggggata tctgcgtggc cgaaaacttt gtgtgcactt ttcggcccct ggttccgggc 1080 
61 tatagagaag caccttgtgg ctgggttacc cccaggttat tactatgggg acctgtacac. 1140 
63 ggaagccgat ctgcatcgtt ctgtgctttg cgcgcctgct ggtcaccttg tttttgagaa 1200 
65 tgatttctca gagtttgact caacgcagaa taatgtgtcc cttgatctcg aatgtgaatt 1260 
67 gatgcgcagg tttgggatgc ccgattggat ggtagccttg taccatcttg ttcgatcata 1320 
69 ctggctcttg gttgccccga aagaagccct tcgtggctgt tggaaaaaac actctggtga 13 80 
71 gccgggcacc cttttgtgga atacagtttg gaacatgact gtgttgcatc atgtttatga 1440 
73 gtttgatcga ccaagtgtgt tgtgtttcaa aggtgatgat agtgtcgttg tctgtgaatc 1500 
75 ggtgcgcgcc cgtccagagg gcgttagtct cgtggcagac tgcgggctaa aaatgaagga 1560 
77 caagaccggc ccgtgtggcg ccttttccaa cctgctgatc ttcccgggag ctggtgttgt 1620 
79 ctgcgacctg ttacggcagt ggggccgctt gactgacaag aactgggggc ccgacattca 1680 



file://C:\Crf3\Outhold\VsrJ029840.htm 



4/12/02 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/029, 840 



DATE: 04/12/2002 
TIME: 14:27:57 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\04122002\J029840.raw 



Page 2 of 7 



81 gcggatgcag gaccttgagc aagcgtgtaa ggattttgtt gcacgtgttg taactcaggg 1740 

83 taaagagatg ttgaccatcc agcttgtggc gggttattat ggtgtggaag ttggtatggt 1800 

85 tgaggtggtt tggggggctt tgaaggcctg cgccgcagcc cgcgagaccc tagtgaccaa 1860 

87 caggttgccg gtactaaact tatctaagga ggactgaaca aataacaatc attatgcagt 1920 

89 ctgcgcgtcc atgtgcctta gctgccagtt ctggtgtttg gagtgccagg aaagtggggt 1980 

91 gggatgtcgc tgtgtagatt gttgctcatg cttgcaatgt gctgcggggt gtcaaggggc 2040 

93 tcccaaacgc tcccagccgg aggcaggcgt ggccagcgcc gccgtgacaa ttcagcccag 2100 

95 tggagcactc aacaacgccc cgagggagcc gtcggccccg cccctctcac agacgttgtc 2160 

97 accgcggcag gtactcgcac ggtaccagat gtagatcaag ccggtgccgt gctggtgcgc 2220 

99 cagtataatc tagtgaccag cccgttaggc ctggccaccc ttggtagcac caatgccttg 2280 

101 ctttatgccg caccggtgtc accgttaatg ccgcttcagg acggcacgac gtctaatatc 2340 

103 atgagcacgg agtctagcaa ctatgctcaa taccgtgtac agggcctaac tgtccgctgg 2400 

105 cgcccagttg tgccaaatgc ggtgggcggc ttctctataa gcatggccta ttggccccag 2460 

107 acaacatcca cccctacaag cattgacatg aattccatca cgtccactga cgtccgtgtg . 2520 

109 gtgcttcagc cgggctctgc tggtttgctg actataccac atgagcgttt ggcgtataag 2580 

111 aacaatggtt ggcggtccgt cgaaacggta tccgtcccac aggaggatgc cacgtccggc 2640 

113 atgctcatgg tttgtgtcca cgggaccccc tggaatagtt ataccaatag tgtttacacc 2700 

115 gggccgcttg gtatggttga ttttgccata aagttacagc taaggaactt gtcgcccggt 2760 

117 aatacaaatg ccagggtcac ccgtgtgaag gtgacggccc cacataccat caaggctgac 2820 

119 ccatctggtg ctaccataac aacagcagct gcggccaggt ttatggcgga tgtgcgttgg 2880 

121 ggcttgggca ctgctgagga tggcgaaatt ggtcacggca tccttggtgt tctgtttaac 2940 

12 3 ctggcggaca cagttttagg tggcttgccc tcgacactgc tgcgggcggc gagtggtcag 3000 

12 5 tacatgtacg gccggcctgt ggggaacgcg aacggcgagc ctgaggtgaa actgtatatg 3060 

127 tcggttgagg atgccgttaa cgataaacct attatggtcc cccatgacat cgacctcggg 3120 

129 accagcactg tcacctgcca ggactatggg aatcagcatg tggatgaccg cccatccccg 3180 

131 gccccggccc ctaagcgagc tttgggcacc ctaaggtcag gggatgtgtt gcgtattact 3240 

133 ggctccatgc agtatgtgac taacgccgag ttgttaccgc agagtgtgtc acaggggtac 3300 

135 tttggggccg gcagcaccat gatggtgcat aatttgatca ctggtgtgcg cgcccccgcc 3360 

137 agttcagtcg actggacgaa ggcaacagtg gatggggtcc aggtgaagac tgtcgatgct 3420 

139 agttctggga gtaataggtt tgcagcgtta cctgcatttg gaaagccagc tgtgtggggg 3480 

141 ccccagggcg ctgggtattt ctaccagtat aacagcaccc accaggagtg gatttatttt 3540 

143 cttcagaatg gtagctccgt ggtttggtat gcatatacta atatgttggg ccagaagtca 3600 

145 gatacatcca ttctttttga ggtccggcca atccaagcta gtgatcagcc ttggtttttg 3660 

147 gcacaccaca ctggcggcga tgactgtacc acctgtctgc ctctggggtt aagaacatgt 3720 

149 tgccgccagg cgccagaaga ccagtcacct gagacgcgcc ggctcctaga ccggcttagt 3780 

151 aggacattcc cctcaccacc ctaatgtcgt ggttttgggg ttttaggttg attttctgta 3840 

153 tctgggcgta attgccccta tgtttaattt attgtgattt ttataactgt tcatttgatt 3900 

155 atttatgaaa tcctcccatc tcgggcatag taaaaaaaaa aaaaaa 3946 

158 <210> SEQ ID NO: 2 

159 <211> LENGTH: 146 

160 <212> TYPE: PRT 

161 <213> ORGANISM: Hepatitis E virus 
163 <400> SEQUENCE: 2 

165 Pro Ala Leu Asp Phe Asp Gly Arg Cys Leu Thr Ser Ala Val Asp Leu 

166 1 5 10 15 

169 Gly Leu Gin Pro Thr Ser Trp Arg Thr Val Ser His Arg Cys Pro Trp 

170 20 25 30 

173 Asp Val Cys lie Phe Leu Arg Thr Asp Tyr Pro Thr lie Thr Thr Thr 

174 35 40 45 
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177 Ser Arg Val Leu Arg Ser Val Val Phe Thr Gly 



178 



50 



55 



181 Lys lie Val Phe Thr Gin Val Ala Lys Gin Ser 



182 65 



70 



75 



185 Thr Val His Glu Ala Gin Gly Ser Thr Phe Asp 



186 



85 



90 



189 Ala Thr Leu Asp Ala Arg Gly Leu lie Ala Ser 



190 



100 



105 



193 lie Val Ala Leu Thr Arg His Arg Glu Arg Cys 



194 



115 



120 



197 Gly Gly Val Leu Val Glu He Gly Val Thr Asp 



198 



135 



130 

201 He Glu 

202 145 

205 <210> SEQ ID NO: 3 

206 <211> LENGTH: 439 

207 <212> TYPE: DNA 

208 <213> ORGANISM: Hepatitis E virus 

210 <400> SEQUENCE: 3 

211 accagcattg gatttcgatg gacgctgttt aacgagcgcc 
213 taccagctgg cgcaccgtat cccaccgttg cccttgggac 
215 tgattatccg actatcacca caaccagtag ggtgctgcgg 
217 aaccattggt cagaagatag tgtttaccca ggtggccaag 
219 aacggtccat gaggcgcagg gcagtacttt tgatcagact 
221 tgctcgtggc cttatagctt catctcgcgc gcatgccata 
223 ggagcgctgt agtgtgattg atgttggtgg ggtgctggtc 
225 catgtttaac aatatcgaa 

228 <210> SEQ ID NO: 4 

229 <211> LENGTH: 483 
<212> TYPE: PRT 

Hepatitis E virus 
4 



Glu Thr He Gly Gin 
60 

Asn Pro Gly Ser He 
80 

Gin Thr Thr He He 
95 

Ser Arg Ala His Ala 
110 

Ser Val He Asp Val 
125 

Ala Met Phe Asn Asn 
140 



gttgatcttg 
gtttgtatat 
tctgttgtgt 
cagtcgaacc 
actataatcg 
gttgcgctaa 
gagattggag 



ggttgcagcc 
ttttgcgtac 
ttaccggtga 
ccgggtccat 
ccacgttaga 
cccgccaccg 
ttactgatgc 



230 



231 <213> ORGANISM: 
233 <400> SEQUENCE: 



236 1 
239 As 
240 

243 Vc 
244 



10 



20 



25 



35 



40 



247 


Leu 


Ala 


Ala 


He 


Asn 


Pro 


Pro 


248 




50 










55 


251 


Tyr 


Met 


Pro 


Ala 


Arg 


Leu 


Asp 


252 


65 










70 




255 


Leu 


Ser 


Asp 


Thr 


Val 


His 


Cys 


256 










85 






259 


Ala 


Val 


He 


Asn 


Thr 


Leu 


Val 


260 








100 








263 


Pro 


Glu 


Val 


Glu 


Tyr 


Asp 


Leu 


264 






115 










267 


His 


He 


Gly 


Pro 


He 


Asn 


Pro 



90 



105 



120 



Gly 


Val 


Leu 


Arg 


Ala 
15 


Pro 


Pro 


Pro 


Ala 


His 
30 


Thr 


Asp 


Gly 


His 


Ala 
45 


Pro 


Leu 


Glu 


Leu 


Glu 
60 


Gin 


Gly 


Leu 


Leu 


Glu 


Val 


Val 


Lys 


Leu 


Gin 


75 










80 


Ala 


Pro 


Thr 


Ser 


Arg 
95 


Leu 


Gly 


Lys 


Ala 


Thr 
110 


Lys 


Leu 


He 


Ala 


Gin 
125 


Phe 


Trp 


His 


Glu 


Tyr 


Ala 


Glu 


Met 


Cys 



60 
120 
180 
240 
300 
360 
420 
439 
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268 




130 










135 


271 


Glu 


Ala 


Met 


Leu 


Ser 


Lys 


Glv 


272 


145 










150 




275 


Asp 


Leu 


Gin 


Asp 


Ala 


Asp 


Cys 


276 










165 






279 


Cys 


Ala 


LVS 


Phe 


Thr 


Leu 


Asp 


280 








180 








283 


Gin 


Glv 


lie 


Ser 


Ala 


TrD 

tr 


Pro 


284 






195 










287 


Trp 


Phe 


Ara 


Ala 


He 


Glu 


Lvs 


288 




210 










215 


291 


Tvt 

x 1 x 


Tvt 


Tvt 


Gly Asp 


Leu 


Tvt 


292 


225 










230 




295 


Leu 


Cys 


Ala 


Pro 


Ala 


Glv 


His 


296 










245 






299 


Phe 


Asp 


Ser 


Thr 


Gin 


Asn 


Asn 


300 








260 








303 


Met 


Arcr 


Arg 


Phe Gly 


Met 


Pro 


304 






275 










307 


Val 


Arg 


Ser 


Tyr 


Trp 


Leu 


Leu 


308 




290 










295 


311 


Cys 


Trp 


Lys 


Lys 


His 


Ser 


Glv 

vjr j_y 


312 


305 














315 


Val 


Tro 


Asn 


Met 


Thr 


Val 


Leu 


316 










325 






319 


Ser 


Val 


Leu 


Cys 


Phe 


Lys 


Glv 


320 








340 








323 


Val 


Arg 


Ala 


Arg 


Pro 


Glu 


Gly 


324 






355 










327 


j-i y o 


Met 


T,v<3 

j-j y o 


Asp 


Lys 


T hi T 

J. 11 J. 


vj J. y 


328 




370 










37S 


331 


lie 


Phe 


Pro 


Gly Ala 


Glv 


Val 


332 


385 










390 




335 


Arg 


Leu 


Thr 


Asp 


Lys 


Asn 


T TO 


336 










405 






339 


Leu 


Glu 


Gin 


Ala 


Cys 


Lys 


Asp 


340 








420 








343 


Lys 


Glu 


Met 


Leu 


Thr 


He 


Gin 


344 






435 










347 


Val 


Gly 


Met 


Val 


Glu 


Val 


Val 


348 




450 










455 


351 


Ala 


Arg 


Glu 


Thr 


Leu 


Val 


Thr 


352 


465 










470 




355 


Lys 


Glu 


Asp 










359 


<210> SEQ ID NO: 


5 






360 


<211> LENGTH: 14 50 






361 


<212> TYPE: 


DNA 








362 


<213> ORGANISM: 


Hepatitis E 


364 


<400> SEQUENCE: 


5 







140 



Gin 


Asp 


Glv 


Ser 


Leu 


He 


Val 


His 


Leu 








155 










160 


Ser 


Arg 


He 


Thr 


Phe 


Phe 


Gin 


Lys 


Asp 






170 










175 




Asp 


Pro 


Val 


Ala 


His 


Glv 


Lys 


Val 


Glv 




185 










190 






Lys 


Thr 


Leu 


Cys 


Ala 


Leu 


Phe 


Glv 


Pro 


200 










205 








His 


Leu 


Val 


Ala 


Glv 

vj j-y 


Leu 


Pro 


Pro 


Glv 










220 










Thr 


Glu 


Ala 


Asp 


Leu 


His 


Arg 


Ser 


Val 








235 










240 


Leu 


Val 


Phe 


Glu 


Asn 


Asp 


Phe 


Ser 


Glu 






250 










255 




Val 


Ser 


Leu 


Asp 


Leu 


Glu 


Cys 


Glu 


Leu 




265 










270 






Asp 


Trp 


Met 


Val 


Ala 


Leu 


Tvr 

± y x 


His 


Leu 


280 










285 








Val 


Ala 


Pro 


Lys 


Glu 


Ala 


Leu 


Arg 


Glv 










300 










Glu 


Pro Gly 


Thr 


Leu 


Leu 


Trn 

IT 


Asn 


Thr 








315 










320 


His 


His 


Val 


Tvr 

± y j. 


Glu 


Phe 


Asp 


Arg 


Pro 






330 










335 




Asp 


Asp 


Ser 


Val 


Val 


Val 


Cys 


Glu 


Ser 




345 










350 






Val 


Ser 


Leu 


Val 


Ala 


Asp 


Cys 


Glv 


Leu 


360 










365 








ir iu 


Cys Gly 


Ala 


php 

c 1 J.C 


C £J -K- 


Aon 
noli 


T on 
•UcU 


Ton 










380 










Val 


Cys 


Asp 


Leu 


Leu 


Arg 


Gin 


Trp 


Gly 








395 










400 


Gly 


Pro 


Asp 


He 


Gin 


Arg 


Met 


Gin 


Asp 






410 










415 




Phe 


Val 


Ala 


Arg 


Val 


Val 


Thr 


Gin 


Gly 




425 










430 






Leu 


Val 


Ala 


Gly 


Tyr 


Tyr 


Gly 


Val 


Glu 


440 










445 








Trp 


Gly Ala 


Leu 


Lys 


Ala 


Cys 


Ala 


Ala 










460 










Asn 


Arg 


Leu 


Pro 


Val 


Leu 


Asn 


Leu 


Ser 








475 










480 



virus 
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365 gcttgtgcga cctgatgctg cagcccctgc cggggtgcta cgagccccag acgacaccgt 60 

3 67 ggatggcttg ttggacatac ccccggccca cactgatgta gcggcggtgt taacagctga 120 

3 69 ggcgattggg catgcgcccc ttgaattggc cgccataaat ccacccgggc ctgtattgga 180 
371 gcagggccta ttatacatgc cggccaggct tgatgggcgt gatgaggttg ttaagctcca 240 
373 gctgtcggat actgtacact gccgcctggc tgcacccact agccgtcttg cggtgattaa 300 
375 cacattggtt gggcggtacg gtaaagccac taagctgcct gaggttgaat atgacttaat 360 
377 ggacactatt gcgcagttct ggcatcatat cggaccaatc aacccctcaa cactggagta 420 
37 9 tgcagagatg tgcgaggcca tgcttagtaa gggccaggat gggtccttga ttgtacatct 4 80 
381 ggatttacag gatgctgatt gttctcgcat aacattcttc cagaaggact gcgctaaatt 540 
383 tacgctggat gaccctgttg cacacggtaa agtgggacag gggatatctg cgtggccgaa 600 
385 aactttgtgt gcacttttcg gcccctggtt ccgggctata gagaagcacc ttgtggctgg 660 
387 gttaccccca ggttattact atggggacct gtacacggaa gccgatctgc atcgttctgt 720 
389 gctttgcgcg cctgctggtc accttgtttt tgagaatgat ttctcagagt ttgactcaac 7 80 
391 gcagaataat gtgtcccttg atctcgaatg tgaattgatg cgcaggtttg ggatgcccga 840 
393 ttggatggta gccttgtacc atcttgttcg atcatactgg ctcttggttg ccccgaaaga 900 
395 agcccttcgt ggctgttgga aaaaacactc tggtgagccg ggcacccttt tgtggaatac 960 
397 agtttggaac atgactgtgt tgcatcatgt ttatgagttt gatcgaccaa gtgtgttgtg 1020 
399 tttcaaaggt gatgatagtg tcgttgtctg tgaatcggtg cgcgcccgtc cagagggcgt 1080 
401 tagtctcgtg gcagactgcg ggctaaaaat gaaggacaag accggcccgt gtggcgcctt 1140 
403 ttccaacctg ctgatcttcc cgggagctgg tgttgtctgc gacctgttac ggcagtgggg 1200 
405 ccgcttgact gacaagaact gggggcccga cattcagcgg atgcaggacc ttgagcaagc 1260 

4 07 gtgtaaggat tttgttgcac gtgttgtaac tcagggtaaa gagatgttga ccatccagct 1320 
409 tgtggcgggt tattatggtg tggaagttgg tatggttgag gtggtttggg gggctttgaa 1380 
411 ggcctgcgcc gcagcccgcg agaccctagt gaccaacagg ttgccggtac taaacttatc 1440 
413 taaggaggac 14 50 

416 <210> SEQ ID NO: 6 

417 <211> LENGTH: 606 

418 <212> TYPE: PRT 

419 <213> ORGANISM: Hepatitis E virus 
421 <400> SEQUENCE: 6 

42 3 Met Ser Leu Cys Arg Leu Leu Leu Met Leu Ala Met Cys 
424 15 10 

427 Ser Arg Gly Ser Gin Thr Leu Pro Ala Gly Gly Arg Arg 

428 20 25 



431 Arg Arg Asp Asn Ser Ala Gin Trp Ser Thr Gin Gin Arg 



Cys Gly Val 
15 

Gly Gin Arg 
30 

Pro Glu Gly 



432 



35 



40 



45 



435 


Ala 


Val 


Gly 


Pro 


Ala 


Pro 


Leu 


Thr 


Asp 


Val 


Val 


Thr 


Ala 


Ala 


Gly 


Thr 


436 




50 










55 










60 










439 


Arg 


Thr 


Val 


Pro 


Asp 


Val 


Asp 


Gin 


Ala 


Gly 


Ala 


Val 


Leu 


Val 


Arg 


Gin 


440 


65 










70 










75 










80 


443 


Tyr 


Asn 


Leu 


Val 


Thr 


Ser 


Pro 


Leu 


Gly 


Leu 


Ala 


Thr 


Leu 


Gly 


Ser 


Thr 


444 










85 










90 










95 




447 


Asn 


Ala 


Leu 


Leu 


Tyr 


Ala 


Ala 


Pro 


Val 


Ser 


Pro 


Leu 


Met 


Pro 


Leu 


Gin 


448 








100 










105 










110 






451 


Asp 


Gly 


Thr 


Thr 


Ser 


Asn 


He 


Met 


Ser 


Thr 


Glu 


Ser 


Ser 


Asn 


Tyr 


Ala 


452 






115 










120 










125 








455 


Gin 


Tyr 


Arg 


Val 


Gin 


Gly 


Leu 


Thr 


Val 


Arg 


Trp 


Arg 


Pro 


Val 


Val 


Pro 


456 




130 










135 










140 










459 


Asn 


Ala 


Val 


Gly 


Gly 


Phe 


Ser 


He 


Ser 


Met 


Ala 


Tyr 


Trp 


Pro 


Gin 


Thr 
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